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One of the challenges of collider physics is to unambiguously associate detector based objects 
with the corresponding elementary physics objects. A particular example is the association of 
calorimeter-based objects such as "jets", identified with a standard (IR-safe) jet algorithm, with 
the underlying physics objects, which may be QCD-jets (arising from a scattered parton), electrons, 
photons and, as discussed here, photon-jets (a group of collinear photons). This separation is 
especially interesting in the context of Higgs searches, where the signal includes both di-photon 
(in the Standard Model) and di-photon-jet decays (in a variety of Beyond the Standard Model 
scenarios), while QCD provides an ever-present background. Here we describe the implementation 
of techniques from the rapidly evolving area of jet-substructure studies to not only enhance the more 
familiar photon-QCD separation, but also separately distinguish photon-jets, i.e., separate usual 
jets into three categories: single photons, photon-jets and QCD. The efficacy of these techniques for 
separation is illustrated through studies of simulated data. 



I. INTRODUCTION 

The Large Hadron Collider (LHC) has clearly exhib- 
ited its ability to make discoveries with the observation 
of a new resonance pQ [5] with even spin that decays to 
photons and Z bosons as expected of the Standard Model 
(SM) Higgs particle. Thus precise measurements of the 
decays of this resonance into various channels (whether 
standard or not), are of the utmost importance. At the 
same time, it is essential to verify our understanding of 
the existing channels, in particular, h — > 77. How well 
are these photons defined? Can physics objects other 
than single photons leave signatures in the detector sim- 
ilar to that of a photon? Not surprisingly, the answer 
is yes [SHS]. Given the granularity of the calorimeters, 
an object consisting of (nearly) collinear photons, typi- 
cally labeled a photon-jet, will generate a signature sim- 
ilar to that of a single photon. The possibility that the 
Higgs particle decays to multiple collinear photons is not 
new [21 E] . Simple models where the Higgs decays to al- 
most massless scalars that each in turn decay to a pair of 
photons, typically do not give rise to events with four 
separately identifiable photons, but rather to pairs of 
photon-jets, each with 2 photons. Slightly more com- 
plicated models can produce Higgs decays to photon-jets 
with 4, 6, • • • photons. We will discuss concrete models 
where the Higgs decays to photon-jets with 2 and 4 pho- 
tons per photon-jet. Thus it is essential to develop tools 
to separate single photons from photon-jcts from QCD- 
jets. Otherwise we are unlikely to understand either the 
signal or the background. 

ATLAS recently made attempts to identify photon-jets 
from Higgs decays [6^ . These analyses rely on relaxing the 
isolation/shower shape criteria, which use the differing 
distributions of energy deposition within the calorime- 
ter cells to quite successfully discriminate single photons 
from QCD-jets. Unfortunately, the parameters of the un- 
derlying model can be easily adjusted so that the resul- 



tant photon-jets pass the strictest isolation/shower shape 
criteria just like photons. More importantly, loosening 
isolation criteria results in a larger fake rate for QCD- 
jets. Discriminating photon-jets from QCD-jets is more 
challenging than separating single photons from QCD- 
jets. 

Fortunately jet substructure techniques [THT^ have re- 
cently been developed to distinguish QCD-jets from jets 
containing boosted heavy particle decays, and we can 
use this work for detection of photon-jets. More broadly, 
'jets', as defined by an infrared safe jet clustering algo- 
rithm, are being proposed as a universal language to de- 
scribe all calorimeter objects including single photons, 
photon-jets and QCD-jets. By using the tools developed 
in jet substructure physics, we do not need to rely on iso- 
lation cuts. We supplement the traditional/conventional 
variables currently used to discriminate photons from 
QCD-jets with substructure variables that probe in de- 
tail the energy distribution within the jet. Note that 
the photons-jets are composed of energetic photons dis- 
tributed inside the jet, where the distribution is a result 
of the kinematic features of the model, e.g., the masses 
and spins of the intermediate particles. The existence of 
this structure within photon-jet suggests that substruc- 
ture variables will be efficient at finding and discriminat- 
ing photon-jets. We show that our analysis is capable 
of separating photon-jets from both single photons and 
QCD-jets at least as efficiently as the traditional discrim- 
inators separate photons from QCD-jets. 

There is another important advantage to applying 
jet substructure techniques to purely electro-magnetic 
calorimeter (ECal) objects. The introduction of 'groom- 
ing' algorithms (including filtering [lOl [131 E]; prun- 
ing [Ill[in], and trimming [T7j) promised to suppress the 
undesirable contributions to purely hadronic jets from 
the underlying event (the largely uncorrelated soft inter- 
actions surrounding the interesting hard scattering) and 
from pile-up (the truly uncorrelated proton-proton colli- 
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sions that occur in the same time window). Indeed, the 
recent results from studies at ATLAS [18, and CMS [19] 
indicate this grooming is effective. We expect that this 
substructure-based grooming will work as well for all 
ECal based objects. 

It should be noted that in the context of Higgs physics, 
the decay to photon-jets is not the only example where 
the coUinearity of the decay products adds complexity to 
the analysis. CoUinearity plays a role for traditional de- 
cays of the Higgs boson when it is boosted. In Ref. HTO] , 
the authors exploited the coUinearity of the &-quarks in 
boosted Higgs decays (both quarks in a single jet) to 
greatly enhance the chances of detecting the ft, — > 66 
channel, featuring jet substructure as a mainstream tool 
(see also Refs. [ZH3)- The application of jet substruc- 
ture in Higgs physics has now become a very active area 
of research, applied both to the SM Higgs pU^E2] as 
well to beyond the SM Higgs scenarios For 
reviews, more detailed descriptions, and references see 
Refs. [MlllQ]. 

The paper is organized as follows: in Sec. |llj we start 
with a simplified model for photon-jets. We propose a set 
of benchmark points, where we take different combina- 
tions of masses and parameters in the simplified model 
to produce photon-jets displaying a variety of distinct 
kinematics. In Sec. IIIII we define the details of our simu- 
lation. We describe, in detail, how we generate samples 
of photon-jets, one for each of the benchmark points, 
QCD-jets, and single photons. We present our analy- 
sis in Sec. IIVI We describe all the variables that we use 
in this work to discriminate photon-jets from QCD-jets 
from single photons. Then we combine these variables 
in a multivariate analysis. We train boosted decision 
trees (BDTs) using the samples of jets and use these to 
optimize the discriminating power of our analyses. We 
also show how these BDTs can be used to simultaneously 
separate photon-jets, photons, and QCD-jets from each 
other. Our conclusions are presented in Sec. |V] 



II. SIMPLE MODEL FOR PHOTON-JETS 

By definition, photons-jets refer to calorimeter objects 
consisting of more than one hard photon. However, such 
a broad definition presents a challenge since all photon- 
jets are not the same. They differ in terms of the number 
of hard constituent photons as well as in the distribu- 
tion of those photons within the photon-jet. To provide 
a systematic phenomenological study of photon-jets we 
classify these objects in more detail in terms of the pro- 
duction mechanism and consider a broad range. We will 
refer to the various production scenarios as 'benchmark' 
scenarios. We find that a simple model in the spirit of 
Ref. [21] with two new particles is sufficient to charac- 
terize these benchmarks. The model includes a small 
number of interactions and we can vary the strength of 
these interaction and the new particle masses in order 
to generate the benchmark scenarios. In particular, we 



introduce two scalar fields ni and n2 of mass toi and m2 
respectively. Without loss of generality, we choose the 
naming convention such that mi > m2. Neither ni nor 
n2 carry any SM charges. We use the following interac- 
tions to generate photon-jets 

^Hhhnl + ^fii2ninl + [ — ni + — 712) Ff^^F^^ , (1) 

2 2 V»Tll TO2 / 

where fJ-fn tJ-i2 are mass parameters, 771,772 are dimension- 
less coupling constants, and F^i, is the electromagnetic 
field strength operator. 

This simple model bears a resemblance to a Higgs por- 
tal scenario [52HM| because of the fj,h coupling. In the 
Higgs portal language, rii and 712 constitute a 'hidden' 
sector while the coupling /i/i provides a tunnel to the 
corresponding 'hidden valley'. The electromagnetic cou- 
plings (proportional to the 77 parameters) provide ways 
for the new particles to decay back to SM particles, pho- 
tons in this case. With respect to Higgs physics, this 
simple model provides a realistic example where the SM 
Higgs field decays through the new particles to multiple 
photons. In the limit mi <^ mh, the resultant photons 
(the decay products of rii) are essentially coUinear. 

In Table |!] we list the benchmark scenarios (labeled 
photon-jet study points or PJSPs) that we investigate 
in this work. All are generated by varying the parame- 
ters in Eq.Q. The symbol X in Table |l] denotes that a 
non-zero value is selected for that parameter, which then 
determines the decay mode. We have chosen the bench- 
marks in such a way that the parameters denoted by X 
only change the total width of the decaying particles. As 
long as the decays are prompt, the exact values of these 
parameters are irrelevant to the phenomenological prop- 
erties of the photon-jets. In all these study points we 
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TABLE I. The study points used in our analysis. For 
PJSP 1 — 3, 772 does not participate in the decay chain since 
/7i2 = and the 7772 and 772 columns are empty. By X we de- 
note that a non-zero value is chosen for the parameter, which 
facilitates prompt decays, but the specific value plays no role. 

take the Higgs particle to decay to a pair of ni parti- 
cles. The small ni mass (ttii ^ 777^) ensures that the 
decay products of the ni are highly coUimated. In the 
Higgs particle rest frame, which is close to the labora- 
tory frame on average, each rii has momentum ~ mu/'i- 
and the typical angular separation between the ?7i decay 
products is of the order of Ami/mh- Note that, given 
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we always consider mi < 10 GeV, we expect the typical 
angular separation between the ni decay products to be 
< 1/3 (we use nih = 120 GeV). As long as the angular 
size of photon-jets is larger than 1/3, we expect to cap- 
ture all the decay products of the ni in each photon-jet 
for all the benchmark points. 

For the study points PJSP 1 — 3 the mass parameter 
^12 is set to zero and ni — ^ 77 is the only possible ni 
decay mode. Hence these scenarios are characterized by 
photons-jets with typically 2 hard photons per jet, and 
n-2 plays no role in the phenomenology (so no 77.2 mass 
or coupling values are included in the table). In these 
scenarios the Higgs particle cascade decays to four pho- 
tons {h — niTii — ?► 7777)- The precise value of mi gov- 
erns the angular separation of the two photons inside the 
photon-jets. For a very small mi, each photon-jet looks 
much like a single photon. (Of course, if the Higgs is 
highly boosted, the decay results in a single photon-jet 
containing all 4 photons.) 

For study points PJSP 4 — 8 we set rji to zero and /ii2 
to a non-zero value. In these contrasting scenarios the 
only ni decay mode involves the chain ni — >■ n2n2 
7777. Hence the Higgs decays again to two photon-jets, 
but now each photon-jet typically contains four photons 
(the ni decay products). (In this highly boosted 

Higgs yields a single photon-jet containing 8 photons.) 



III. 



SIMULATION DETAILS 



In order to generate samples of photons-jets, we imple- 
ment the simple model of Eq. ([T]) in MadGraph 5 [55] . 
For each benchmark point we generate matrix elements 
corresponding to the process — ^ /i — >■ niui (via gluon 
fusion) using MadGraph 5 with = 120 GeV, which 
we employ as input to Pythia 8.1 [3S1 in order to 
generate the full events and for the subsequent ni de- 
cays. Since the Higgs production is evaluated at lowest 
order, the produced Higgs particles have zero transverse 
momentum. We use the QCD dijet events generated by 
standalone Pythia 8.1 to provide a sample of QCD-jets. 
In order to define a sample of single photons, we also 
generate — > /i — > 77 events where the photons are 
well separated. Finally, we include initial state radiation 
(ISR), final state radiation (FSR) and multiple parton in- 
teractions (MI, i.e., the UE) as implemented in Pythia 8.1 
to simulate the relevant busy hadronic collider environ- 
ment. 

The Pythia output final states are subjected to our 
minimal detector simulation. In the following we describe 
briefly how we treat the final state particles in each event: 

• We identify all charged particles with transverse 
momentum px > 2 GeV and pseudorapidity \ri\ < 
2.5 as charged tracks. 

• In a real detector, tracks are also generated if pho- 
tons convert within the pixel part of the tracker. In 



this work, we simulate this photon conversion pro- 
cess by associating with each photon a probability 
for it to convert in the tracker. 

The probability is a function of the number of radi- 
ation lengths of material the photon has to traverse 
in order to escape the inner part of the tracker. We 
use the speciflcations of the ATLAS detector in or- 
der to model this pseudorapidity dependent prob- 
ability distribution. The details of this procedure 
are outlined in the Appendix |A 1| 

• In our simulation, all particles (except charged par- 
ticles with £^ < 0.1 GeV) reach the calorimeters, 
and all of these (except muons with pT > 0.5 GeV) 
deposit all of their energy in the calorimeters. The 
electromagnetic calorimeter (ECal) is modeled as 
cells of size 0.025 x 0.025 in {rj-cj)), whereas the 
hadronic calorimeter (HCal) is taken to have more 
coarse granularity with 0.1 x 0.1 cells. Besides pho- 
tons and electrons, soft muons and soft hadrons 
(soft means E < 0.5 GeV) are treated as deposit- 
ing all of their energy in the ECal. More energetic 
hadrons are absorbed in the HCal, while more en- 
ergetic muons escape the calorimeter. For a more 
detailed picture see Appendix |A 2| 



• We attempt to simulate the showering that occurs 
within the ECal. We distribute the energy of each 
particle that is absorbed in the ECal into a (3 x 3) 
grid of cells (centered on the direction of the orig- 
inal particle) according to a precomputed Moliere 
matrix corresponding to the Moliere radius of lead. 
For details on this transverse smearing see Ap- 
The structure induced by this shower 
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simulation is observable in our final results. 

• We implement calorimeter energy smearing for 
both the ECal and the HCal. The calorimetric re- 
sponse is parametrized through a Gaussian smear- 
ing of the accumulated cell energy E with a vari- 
ance a: 



a 

E 



S 



+ c, 



(2) 



where S and C are the stochastic and constant 
terms. For the ECal and the HCal, we use {S, C) to 
be (0.1,0.01) and (0.5,0.03), respectively, in order 
to approximately match the reported calorimeter 
response from ATLAS [55]. 

• Each calorimeter cell that passes an energy thresh- 
old becomes an input for our jet clustering algo- 
rithm. For the ECal cells we require Et > 0.1 GeV, 



^ We do not simulate the magnetic field in the detector. Conse- 
quently the e+e~ pairs from photon conversion continue in the 
direction of the photon. So for every converted photon we obtain 
effectively a single track, if the photon passes the threshold. 
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while for the HCal cells we use the somewhat harder 
cut Et > 0.5 GeV. We sum all the energy de- 
posited in a given calorimeter cell and construct a 
massless 4- vector with the 3- vector direction corre- 
sponding to the location of that cell. 

• As the final step we cluster the 4-vectors corre- 
sponding to the calorimeter cells into jets using 
Fastjet 3.0.3 [351 SO]- In particular, we use the 
anti-A;T jet clustering algorithm [U] with R = 0.4 
and and require pr > 50 GeV for every jet. Only 
the leading jet from each event is retained for fur- 
ther analysis in order to maintain independence 
among the jets in the sample. 



IV. ANALYSIS 

In this section we describe the analysis of 10 samples of 
jets generated according to the prescription of the previ- 
ous sections. The first sample contains QCD jets derived 
from QCD dijet events. The second sample consists of 
jets from /i — 77 events where each jet typically contains 
one of the photons from the Higgs decays, plus contribu- 
tions from the rest of the event (ISR, FSR, UE). We refer 
to the jets in this sample as single photon jets, or simply 
single photons. The remaining 8 samples of jets are the 
photon-jet samples and correspond to the 8 study points 
in Table|l] As noted above, in these events the Higgs par- 
ticle decays into 4 or 8 photons and the corresponding 
photon-jets typically contain either 2 or 4 photons. The 
resulting distributions for QCD-jets (red), photon-jets 
(blue) (PJSP 8) and single photons (green) are indicated 
in Fig.[l] 

0.035 r 1 

0.030 '\ 



§ 0.020: J V\ 
■g 0.015- // 

0.010 :Jf ^'^m-^ 
0.005 : ^^^^^^-^^^.^^ 
50 60 70 80 90 100 
Pt (GeV) 

FIG. 1. The Pt distribution of jets for QCD-jets (red), 
single photons (green), and for photon-jets (blue) from the 
study point PJSP 8. Jets are constructed as described in the 
text (the anti-AiT algorithm with R = 0.4). 



The specific values are chosen to mimic the choices for real 
detectors and the difference between the two accounts for the 
differing noise levels in calorimeter cells of different sizes. 



As expected, the px distribution for QCD-jets is a 
falling distribution, while both the single photon and 
photon-jet distributions exhibit a peak near to^/2(= 
60 GeV). We understand this last point as arising from 
the production of Higgs particles with zero transverse 
momentum followed by 2-body decays (either 2 photons 
or 2 Til's). It is the remnants of these two bodies that 
are typically captured in the jets yielding the indicated 
peaks near px ~ to/i/2. For the photon-jet sample we 
only show the pr spectrum for the study point PJSP 8, 
but note that the px distributions are almost identical for 
all other benchmark points. As indicated in Fig. [T] the 
jets in all of these samples of events have crudely com- 
parable transverse momentum distributions in the range 
50—100 GeV, although the QCD sample is more strongly 
peaked at the low end. Thus studying the jets in these 
samples should provide a useful laboratory in which to 
study photon-jets, QCD jets and single photons. 

The remainder of this section describes a systematic 
analysis aimed at distinguishing photon-jets from QCD- 
jets as well as from single photons. We begin with brief 
descriptions of the variables that provide the discriminat- 
ing power. The variables are organized into two groups: 
(i) conventional variables and (ii) substructure variables. 
We demonstrate how each of these variables individually 
discriminates photons-jets form the jet samples. Later in 
this section, we combine these variables in a multivariate 
analysis in order to maximize the separation of photon- 
jets from QCD-jets as well as from single photons. 



A. Conventional Variables 

The conventional variables we describe below are well 
known, well understood, and play essential roles in the 
identification of single photons, i.e., the separation from 
QCD-jets. We expect these variables to play a similar 
role in separating photon-jets from QCD-jets, since the 
probability distributions as functions of these variables 
are similar for photon-jets and for single photons. On 
the other hand, they cannot be expected to efficiently 
discriminate photon-jets from single photons. 



1. Hadronic Energy Fraction, 9j 

We define the hadronic energy fraction 9j for a jet to 
be the fraction of its energy deposited in the hadronic 
calorimeter: 

Oj = ^ E (3) 

'' iGHCale J 

where Ej is the total energy of the jet, and Ei is the 
energy of the i-th HCal cell that is a constituent of the 
jet. This is the most powerful variable for discriminat- 
ing a single photon or a photon-jet (objects that deposit 
most of their energy in the ECal) from QCD-jets. Since 
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a QCD-jet typically contains 2/3 charged pions and 1/3 
neutral pions, we expect to see a peak at 9j ~ 2/3 
(log 6',/ ^ —0.2) for QCD-jets. Isolated single photons 
and photon-jets, on the other hand, should exhibit very 
small 9 J values. However, we start with objects identi- 
fied by a jet algorithm so there will be contributions from 
the rest of the event and pile-up, and from leakage from 
the ECal into the HCal. Thus the precise value of Oj for 
single photons and photon-jets will depend on detailed 
detector properties and on the contribution from the un- 
derlying event and pile-up. Nevertheless, we expect sin- 
gle photons/photon-jets to exhibit very small values of 
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FIG. 2. The probability distributions for jets as functions of 
\og9j for QCD-jets (red), single photons (green) and photon- 
jets from PJSP 8 (blue). The first bin of the plot (at Oj = 
10"^) has an open lower boundary, i.e., it includes all jets 
with logOj < -3.0. 



Figure [2] shows the probability distribution versus 
logOj for QCD-jets (red), single photons (green), and for 
photon-jets (blue) in our simulated data. For the photon- 
jets we only show the study point PJSP 8, since the dis- 
tribution is essentially identical for the other benchmark 
points. As expected the QCD-jet distribution peaks near 
\og6j = —0.2 {0j = 2/3), while the single photon and 
photon-jet distributions are very similar with a peak near 
log 6 J = —1.9 and an implied tail to very small 9j values. 
The clear separation of the single photon/photon-jet dis- 
tributions from the QCD-jet distribution indicates why 
this variable plays such an important role in the separa- 
tion of QCD-jets from photons. 

Any reasonable cut on 9j {9j ^ 0.1) will reduce the 
QCD-jet contribution by factors of lO^^-lO^'^, while 
barely changing the photon/photon-jet contribution. We 
impose a preliminary cut by keeping only dj < 0.25 
(log 6*7 < -0.6). About 2% of the original QCD-jets 
survive this cut, while approximately 94% of the single 
photons/photon-jets survive. We use the modified jet 
samples that pass this preliminary 9j cut for the remain- 
der of this paper. 



2. Number of Charged Tracks, vj 

In conventional collider phenomenology, the number 
of charged particles (tracks) associated with an object 
is often used to distinguish objects from each other. Al- 
though photons and electrons generate similar signatures 
in the ECal, the latter are typically associated with a 
track while the former are not. Tracks also play an im- 
portant role in rejecting QCD-jets since, as mentioned 
before, a QCD-jet typically contains several charged pi- 
ons. 

In our simulated data we keep all charged particles 
with > 2 GeV and assume that all of these corre- 
spond to tracks in a real detector. In order to associate 
these tracks with the jets, which are constructed entirely 
from calorimeter cells, we perform the following analysis. 
First replace each track by an arbitrarily soft light-like 
four vector with the same {rj-cj)) direction as the track, 
and then include these soft four-vectors in the jet clus- 
tering process along with the calorimeter cells. (We ex- 
plicitly check that the inclusion of these soft four-vectors 
does not affect the outcome of the clustering procedure.) 
A track is associated with a jet if the soft four vector 
corresponding to that track is clustered into that jet |^ 
The resulting total number of tracks associated with a 
jet yields the value of vj for that jet. Figure |3] shows 
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FIG. 3. The relative probability distribution for QCD jets 
(red), single photons (green) and photon-jets (blue) versus 
the number of charged tracks associated with a jet. The algo- 
rithm for associating tracks with jets is given in the text. For 
photon-jets we show the distribution for jets from the study 
points PJSP 1 (dotted) and PJSP 8 (sohd). 

the relative probability distribution versus the number 



^ As a check we also consider the more traditional construction 
where a track is associated with a jet if it is within an angular 
distance R or less from the given jet's direction, where R is the 
size-parameter used in the clustering algorithm. For anti-fcy jets 
both methods yield identical associations of tracks and jets. For 
the kj- or C/A algorithms, where jets are not exactly circular, 
the method described in the text is a more natural definition of 
whether a track is associated with a jet or not. 
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of tracks per jet (vj) for QCD-jets (red), single photons 
(green) and photon-jets (blue). As expected, the number 
of tracks associated with QCD-jets varies over a broad 
range and only a tiny fraction of QCD-jets have no as- 
sociated tracks. The single photon/photon-jet samples, 
on the other hand, are dominated by jets with no asso- 
ciated tracks. Photons that convert yield tracks associ- 
ated with the corresponding jets. Since the probability 
of conversion increases with the number of photons per 
jet, the probability of obtaining one of more associated 
tracks increases from single photon jets (single photons) 
to jets with two photons (typical for PJSP 1, the dotted 
blue curve) to jets with four photons (typical for PJSP 8, 
the solid blue curve). As with the variable 9j, vj offers 
some separation between QCD-jets and single photons, 
but much less between single photons and photon-jets 
(and even less between the different types of photon-jets). 



B. Jet Substructure 

Next we want to focus on variables that explicitly char- 
acterize the internal structure of jets, i.e., characterize 
the energetic subjet components of the jet. Recall that in 
this analysis we have identified jets using the the anti-fc^ 
jet algorithm with R = 0.4, but we do not expect the 
general features of our analysis to depend on this spe- 
cific choice. The next step is to determine a 'recombina- 
tion tree' for the jets we want to study (here the leading 
jet in each event). To this end we apply the fc^ algo- 
rithm [m 133] to the calorimeter cells identified as con- 
stituents of the jet in the first step. (We could as well 
use the Cambridge/ Aachen (C/A) algorithm [44ti46] . but 
not the anti-fc^ algorithm in this step as anti-fc^ does 
not tend to produce a physically relevant recombination 
tree.) This recombination tree specifies the subjets at 
each level of recombination N from iV = 1 (the full jet) 
to N = the number of constituent calorimeter cells in the 
jet (no recombination). At the next step the subjet vari- 
ables we study fall into two classes. In the first class we 
attempt to count the effective number of relevant sub- 
jets without using any properties of the subjets in the 
tree except their directions in rj-cj). In this case the useful 
variable (defined in detail below) is called 7V-subjettiness. 
The A^-subjettiness variable for a given jet becomes nu- 
merically small when the parameter N is large enough to 
describe all of the relevant substructure, i.e., this value of 
iV provides a measure of the number of subjets without 
explicitly identifying the subjets. A'^-subjettiness involves 
all components of the original jet for all values of N. 

The rest of the substructure variables we study more 
explicitly resolve a jet into a set of subjets. We de- 
fine both the level in the recombination tree at which 
we choose to work, i.e., the number of subjets we have 
split the jet into and how many of these subjets to use 
in the subsequent analysis. We use A^pre-fiiter (this no- 
tation should become clear shortly) and Ahard to label 
these two parameters. Thus we start with the 4-vectors 



corresponding to the (calorimeter cell) constituents of a 
given jet, and then (re)cluster these constituents using 
the chosen subjet algorithm (which is not necessarily the 
algorithm used to originally identify the jet) in exclusive 
mode, i.e. we continue (re)clustering until there are pre- 
cisely A'prc-fiitor 4-vectors left ~ the ATprc-fiitcr exclusive 
subjets. Out of these A^prc-fiitcr subjets we pick the A'hard 
largest pr subjets and discard the rest. All the substruc- 
ture variables discussed below (except A-subjettiness) 
are constructed using these A^hard subjets. Note that by 
choosing Apio.fiitcr > A'haidj we have performed a version 
of jet 'grooming' typically labeled filtering [101 1131 IS]- 
This will ensure that our results are relatively insensitive 
to the effects of the underlying event and pile-up. Ideally, 
the integers (A'hardi A^pro-fiitcr) should be chosen based on 
the topology of the object we are looking for. However, 
the naive topology will be influenced by the interaction 
with the detector and the details of the jet clustering al- 
gorithm. For example, a 4 photon photon-jet will often 
appear in the detector to have fewer than 4 distinct lobes 
of energy, i.e., one or more photons often merge inside a 
single lobe of energy. In our simulation, we find that the 
choice A^hard = 3 and A^prc-fiiter = 5 is an acceptable com- 
promise, working reasonably well for single photons and 
photon-jets from all the study points. Further optimiza- 
tion will be possible in the context of real detectors and 
searches for specific photon-jet scenarios. 

1. N -Subjettiness, tn 

"A^-subjettiness" , introduced in Ref. [47l|48], is a mod- 
ified version of "A^-jettiness" from Ref. It is adapted 
in a way such that it becomes a property of a jet rather 
than of an event. A^-subjettiness provides a simple way 
to effectively count the number of subjets inside a given 
jet. It captures whether the energy flow inside a jet de- 
viates from the one-lobe configuration expected to char- 
acterize a typical QCD-jet. We use the definition of A^- 
subjettiness proposed in Ref. [57]. The starting point 
is a jet, the full set of 4-vectors corresponding to the 
(calorimeter cell) constituents of the jet (here found with 
the anti-ZcT algorithm for R = 0.4), and the recombina- 
tion tree found with the kx algorithm as outlined above. 
From this tree we know the 4-vectors describing the ex- 
clusive subjets for any level N, i.e., the level where there 
are exactly A^ subjets. With this information we can 
define A^-subjettiness to be 

J2kPTk X min{Ai?i.fc,Ai?i,fe,--- ,ARN,k} 
'^N = 5 , (4) 

where k runs over all the (calorimeter cell) constituents 
of the jet, pt^. is the transverse momentum for the k-th 

constituent, ARi^k = \/{^'ni,kY + {.^4>i,kY is the angu- 
lar distance between the l-ih subjet (at the level when 
there are A^ subjets) and the fc-th constituent of the jet, 
and R is the characteristic jet radius used in the original 
jet clustering algorithm. 



log ri Ts/t-i T3IT2 u/T2 

FIG. 4. Probability distributions vesus various TV-sub jettiness variables. The solid red and green curves show, as usual, the 
distributions for QCD-jets and single photons respectively. Various blue curves are for photon-jets from different study points. 
The solid, dashed, dotted and dash-dotted curves in all these figures are for PJSP 8, PJSP 4, PJSP 1 and PJSP 3 respectively. 



In the context of single photons, photon-jets and QCD- 
jets, we use iV-subjettiness in two different ways. The 
first application is to use the ratios tn+i/tn in the 
same way iV-subjettiness is used to tag boosted mas- 
sive particles such as a,W boson or a hadronic decaying 
top [47l SS]. In particular, for a jet with iVg distinct 
lobes of energy, tjvq is expected to be much smaller than 
tNo-i (of course, we are assuming iVo > 1), whereas for 
N > Nq, Tjv+i is expected to be comparable to tn. Thus 
a two photon photon-jet is expected to be characterized 
by T2/T1 ^ 1. On the other hand, one lobed QCD-jets 
and single photons should exhibit comparable values for 
T2 and Ti, and consequently T2/T1 ^ 1. 

The second way in which we use A^-subjettiness con- 
sists of using the magnitude of ti itself. Even for a jet 
with one lobe of energy the exact magnitude of ri rep- 
resents a measure of how widely the energy is spread. A 
pencil-like energy profile, like that of a single photon or a 
narrow photon-jet, should yield a much smaller ti com- 
pared to QCD-jets with a much broader profile. In fact, 
Ti is an indicator of jet mass, and, for jets with identical 
energy, ti is proportional to the square of the jet mass. 

Figure |4] shows the probability distributions versus 
logTi and Tjv-i-i/TAr for A'^ = 1,2,3 corresponding to 
single photons, QCD-jets and photon-jets from different 
study points. Note that for photon-jets, the jet mass is 
almost always given by the mass parameter mi in Ta- 
ble |T| Thus for PJSP 8 and PJSP 3, where Wi has the 
same value, the probability distributions versus logTi are 
almost identical. For study points PJSP 8, PJSP 4 and 
PJSP 1 the peak in log ti shifts to the left as the value of 
7711 decreases (from 10 GeV to 2 GeV to 0.5 GeV). Note 
also that the PJSP 1 and PJSP 3 distributions exhibit 
a small ti (small mass) enhancement at essentially the 
same ri value as the primary peak in the single photon 
(green curve) distribution. This presumably corresponds 
to those kinematic configurations where only one of the 
(two) photons from the rii decay is included in the jet. 
Thus we expect that a (small) fraction of the time these 
scenarios will look very single photon-like. 

Clearly the ratio T2/T1 gives significant separation for 
the different photons-jet scenarios. The study points 
PJSP 8 and PJSP 3 are now separated, although both 



exhibit peaks at small values of the ratio. This suggests 
an intrinsic 2-lobe structure corresponding to 2 photons 
for PJSP 3 and 4 photons in two relatively tight pairs 
(777,2 < "^i) for PJSP 8. PJSP 4 with presumably a 
more distinctive 4 photon structure exhibits a broader 
peak at a larger value of t^Itx. Single photons and 
PJSP 1 exhibit even broader distributions presumably 
corresponding to an intrinsically 1-lobe structure. The 
QCD-jet distribution is also broad but with an enhance- 
ment around T2/T1 = 0.8, presumably arising from a typ- 
ical 1-lobe structure but some contribution from showers 
with more structure and from the underlying event. The 
ratios T3/T2 or t^Jt-^ seem to be less effective in discrim- 
inating photon-jets from single photons and QCD-jets. 
This can be understood by noting that quite often the 
hard photons inside a photon-jet become coUinear at the 
scale of the size of the cell. So even for photon-jets with 
4 hard photons, we rarely find jets with 4 distinct cen- 
ters of energy. In general we expect the ratio tjv+i/tat 
becomes less and less useful with increasing N . 

Note that the distributions for single photons and 
photon-like photon-jets tend to exhibit a double peak 
structure in T3/T2 or Ti/r^i. We believe that this feature 
arises from both the contributions due to the underly- 
ing event and due to our implementation of transverse 
smearing in the ECal (see Appendix A3). 



2. Transverse momentum of the Leading Subjet 

Now we proceed to discuss the second class of subjet 
variables constructed from the 3 hardest subjets out of 
the 5 exclusive subjets. As the first such variable consider 
the fraction of the jet transverse momentum carried by 
the leading subjet, which provides significant information 
about the jet itself. In particular, it indicates the fraction 
of the jet's total pr carried by the leading subjet only. 
Since photon-jets result from the decay of massive parti- 
cles into hard and often widely separated photons inside 
the jet, the subjets are usually of comparable hardness. 
The leading subjet for single photons and for QCD-jets, 
on the other hand, typically carry nearly the entire pt of 
the jet. So for the majority of these jets, the pt of the 
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leading subjet (label it pr^) is of the order of the pr of 
the entire jet (ptj)- Instead of using the ratio PTl/ptj 
directly we find that it is more instructive to define the 
variable 



A., 



log(l 



PTj 



(5) 



The advantage of using the definition in Eq.(|5| is that it 
focuses on the behavior near pxj^ ^ pTj ■ 

The discussion above depends crucially on how the 
subjets are constructed, especially for QCD-jets. QCD 
partons typically shower into many soft partons/hadrons. 
After showering and hadronization, single hard partons 
yield many soft hadrons distributed throughout the jet. 
The way in which these jets are clustered into subjets 
dictates the pt distribution of subjets. For example, 
for anti-/cr subjets, the hardest subjet will always have 
Pt^ ~ PTj- The kx algorithm, on the other hand, clus- 
ters the softer elements first and results in more evenly 
distributed subjets. The C/A jet algorithm clusters tak- 
ing into consideration only the geometric separations of 
the elements, and produces qualitatively different results. 
Single photons, on the other hand, shower very little (no 
QCD Shower) and deposit energy in only a handful of 
cells (per hard photon). Therefore we expect that our 
results for single photons or photon-jets will be less sen- 
sitive to the details of the clustering algorithm. To versify 
this point we use both kr and C/A subjets to evaluate 
A J from Eq.([5|. The simultaneous use of different clus- 
tering algorithms to extract information from the same 
jet should not come as a surprise. As shown in Ref. [50] . 
substantial further information can be extracted if one 
employs a broad sampling out of all of the physically 
sensible clustering histories (trees) for a given jet. In 
this sense the current analysis is modest in that we only 
use two specific clustering procedures. 
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FIG. 5. Probability distribution for Xj from Eq.([5|. As 
in Fig. |4] the solid red is for QCD-jets, the solid green for 
single photons, dotted blue for PJSP 1, dash-dotted blue for 
PJSP3, dashed blue for PJSP4 and solid blue for PJSP8. The 
left (right) figure shows the distribution when A is calculated 
using C/A (kr) subjets. 

In Fig. [5] we plot the probability distribution of jets 
as a function of Aj for QCD-jets, single photons, and 
photon-jets. The left (right) panel shows the distribu- 
tion when we use the C/A (kx) algorithm to find the 
subjets. Note how the distribution for QCD-jets (the red 



curve) moves more to the right (i.e., the px of the jet 
gets more evenly distributed among its subjets) as we go 
from C/A subjets to kx subjets. The various photon-jet 
study points also look more similar when using the kx 
algorithm. In this case the PJSP 1 and PJSP 3 distri- 
butions exhibit enhancements suggesting the presence of 
both single photon-like behavior (Aj ~ —1-2) and QCD- 
like behavior (Aj ~ —0.2 to —0.3). The more complex 
structure of the PJSP 4 and PJSP 8 jets exhibit a distri- 
bution closer to QCD alone. Finally note that the C/A 
subjets display the jet substructure information differ- 
ently from the kx case with the peak in the QCD-jet 
distribution at least somewhat separated from the peaks 
in the photon-jet distributions. Also for C/A all of the 
photon-jet scenarios exhibit at least a little single photon- 
like enhancement (for kx this is only true for PJSP 1 
and PJSP 3). There is clearly some discrimination to be 
gained from using more than one definition of the subjets. 



3. Energy-Energy Correlation, ej 

Another useful variable is the "energy-energy correla- 
tion". We define it as: 



(6) 



(l>j)eA'hard 



where Ej is the total energy of a given jet, and the indices 
i,j run over the (3 hardest) subjets of the jet. From the 
definition, it should be clear that ej is sensitive to the 
energy of the subleading jets. In particular, the energy- 
energy correlation can be expressed as 



~ ^5 

^ El {E j — E l )-^ E m l E m n l 
' 



(7) 



where E^, Enl, and En^l are the energies of the leading 
subjet, the next-to-leading subjet, and the next-to-next- 
to-leading subjet. 




FIG. 6. ProbabiHty distribution versus ej from Eq.([6]|. As 
in Fig. [4] the solid red is for QCD-jets, the solid green for 
single photons, dotted blue for PJSP 1, dash-dotted blue for 
PJSP3, dashed blue for PJSP4 and solid blue for PJSP8. The 
left (right) figure shows the distribution when ej is evaluated 
using C/A {kx) subjets. 
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We show the probabiUty distribution of jets as a func- 
tion of e,/ for QCD-jets, single photons and photon- 
jets in Fig. [6] Note that for single photons (the green 
curve), EffL and E^^l are negligible and hence we ex- 
pect e,7 for single photons to be well approximated by 
El {Ej — El) /E'\. In fact, the sharp peak for single 
photons in Fig. [5] at —1.2 (fcy algorithm) corresponds 
to the sharp peak at about 0.04 in Fig. |6] More gen- 
erally the qualitative features in Fig. [5] are repeated in 
Fig. [6j For C/A subjets the distributions for all of the 
photon-jet study points exhibit two peaks, the large ej 
value enhancement presumably corresponding to the en- 
ergy being shared approximately equally among several 
final photons, while the small value enhancement arises 
from the case when one photon dominates (perhaps be- 
cause some of the photons are not in the jet). For fc^ 
subjets only the PJSP 1 and PJSP 3 distributions ex- 
hibit the small ej single photon-like enhancement. We 
also see that again the two algorithms yield distinctly 
different distributions for QCD-jets. 

4- Subjet Spread, p,j 

We define "subjet spread" as a measure of the geomet- 
ric distribution of the subjets. 

PJ^]^ E ^^M' (8) 

(■J>j)eA'hard 

where ^Ri,j is the angular distance between the «-th and 
j-th (hard) subjets, and R is the size parameter of the 
jet algorithm. 




FIG. 7. Probability distribution for subjet-spread pj from 
Eq.(|8|. As in Fig. [i] the solid red is for QCD-jets, the solid 
green for single photons, dotted blue for PJSP 1, dash-dotted 
blue for PJSP 3, dashed blue for PJSP 4 and solid blue for 
PJSP 8. The left (right) figure shows the distribution when 
pj is calculated using C/A {kr) subjets. 

The left (right) panel of Fig. [t] shows the probability 
distribution of jets as a function of pj for QCD-jets, sin- 
gle photons and photon-jets when the C/A (fc^-) subjets 
are used to evaluate Eq.Q. For this variable only the 
QCD-jet distribution changes dramatically when chang- 
ing the choice of subjet algorithm from C/A to kx- By 
using both algorithms this feature will provide some abil- 
ity to discriminate between QCD-jets and single photons 



or photon-jets. For the single photon case the the strong 
peak at small pj confirms that all of the subjets are close 
to each other, forming a hard core. Subjet spread is quite 
sensitive to the mass mi as can be seen from the differ- 
ent photon-jet distributions. In particular, the position 
of the peaks for photon-jets with different mi simply fol- 
low the mi value. The PJSP 3 and PJSP 8 distributions 
are nearly the same (with the same mi value), while the 
PJSP 1 and PJSP 4 distributions are just similar (with 
somewhat different mi values), but distinct from PJSP 3 
and PJSP 8. The mi dependence is not surprising since 
the opening angle between the decay products of the rii 
particle depends on mi. Finally we note that the PJSP 3 
and PJSP 8 distributions do have an enhancement at 
small Pj values presumably corresponding to configura- 
tions where the extra photons are not captured in the 
jet. 

5. Subjet Area of the Jet 

As defined in Ref. [5T], the "area" associated with a 
jet is an unambiguous concept that represents quantita- 
tively the amount of surface in the {ri-4>) plane included 
in a jet. In this analysis, we use the "active area" defi- 
nition for the area of the jet. The active area of a jet is 
calculated by adding a uniform background of arbitrar- 
ily soft 'ghost' particles to the event (so that each ghost 
represents a fixed area) and then counting the number of 
ghosts clustered into the given jet. The area of a jet is 
often used to provide a quantitative understanding of the 
largely uncorrelated contributions to a jet from the un- 
derlying event and pile-up. However, it is rarely used in 
phenomenology for the purpose of discovering new parti- 
cles or tagging jets. We use 'subjet area' as a measure of 
the 'cleanliness' of the jet. We show that it can be a use- 
ful tool for distinguishing a single photon or a photon-jet 
from noisier QCD-jets. We define the subjet area fraction 
as 

= ^ E (9) 

where Ai is the area of the i-th subjet and Aj is the area 
of the entire jet. Note that this definition of 5j is only 
useful when the subjets are constructed geometrically by 
merging the nearest neighbors first (i.e., using the C/A 
algorithm). In Fig. [sj we show the probability distri- 
bution for jets as a function of 5j for QCD-jets, single 
photons, and photon-jets. As expected, the figure shows 
that single photons (the green curve) are significantly 
cleaner (exhibit smaller 5j values) than QCD-jets (the 
red curve) and that photon-jets (the blue curves) tend to 
lie in between. Fixing mi such that the first splitting is 
fairly wide, we can investigate the effects of m2. If m2 is 
small, then the two photons coming from the n2 decays 
will be very close together, and the subjet that contains 
them will not collect many ghosts. On the other hand, 
a large m2 will split the two photons further apart and. 
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FIG. 8. Probability distribution versus fractional area 5j 
from Eq.Q. As in Fig. [i] the solid red is for QCD-jets, the 
solid green for single photons, dotted blue for PJSP 1, dash- 
dotted blue for PJSP 3, dashed blue for PJSP 4 and solid blue 
for PJSP 8. We use C/A subjets to calculate Sj. 



if still contained in the same subjet, that subjet wih col- 
lect substantially more ghosts resulting in a subjet with a 
larger active area. QCD-jets contain many soft particles 
and so the subjets in QCD jets have larger areas. Thus 
we see that the QCD distribution peaks for Sj near 0.5, 
while the single photon distribution exhibits both a large 
peak at small (~ 10^^) 5j and a smaller peak at larger 
0.4) 5j values. The photon-jet cases interpolate be- 
tween these two behaviors and this variable can clearly 
provide some discriminating power. 



C. Multivariate Analysis 

We have, so far, introduced a set of well-understood 
variables. In this subsection, we will employ these vari- 
ables in a multivariate discriminant, specifically in a 
Boosted Decision Tree (BDT) ^S]- ^ decision tree is 
a hierarchical set of one-sided cuts used to discriminate 
signal versus background. The 'boosting' of a decision 
tree extends this concept from one tree to several trees 
which form a forest. The trees are derived from the same 
training ensemble by reweighing events, and are finally 
combined into a single classifier. 

In the current discussion we are treating photon-jets 
as the signal and both single photons and QCD-jets as 
background. We construct multiple BDT analyses in or- 
der to estimate how well the photon-jets can be separated 
from single photons and from QCD-jets. This will allow 
us to demonstrate the power of the new jet substruc- 
ture variables when these are combined with the conven- 
tional variables. In practice, we employ the Toolkit for 
Multivariate Analysis (TMVA) 53 package and use the 
"BDTD" option to book BDTs, where the input variables 
are decorrelated first. 

For every study point in Table |l] we optimize two 
separate BDTs, one for discriminating photon-jets form 
QCD-jets and the other for separating photon-jets from 



single photons. We make use of all the variables dis- 
cussed earlier in order to minimize the background fake 
rate (J^ — the fraction of the background jets that pass 
the cuts) for a given signal acceptance rate {A = the frac- 
tion of the signal jets that pass the cuts). For demonstra- 
tion purposes we also consider BDTs made with a subset 
of the full set of variables. To be specific, we consider 
three different sets of variables: 



D = |log 6*7,1^7, log Ti, 



T2 73 T4 

; 7 7 

Tl T2 Tg 

(Aj, ej, pj, Sj) (Aj, ej, pj) \ 



Dc. = 



D. = 



|log6ij,i/j| 

r, T2 T3 T4 

j logri, — , — , — 

I- Tl T2 T3 

(Aj,ej,/0j,(5,/)| 



C/A 



, (Aj,ej,p,7)|^^} , 



(10) 



(11) 



(12) 



where the subscripts C/A or kr in Eqs. ( 10 ) and ( 12 ) im- 
ply that the observables are calculated using C/A~or 
subjets. The sets Dc and Dg consist of the conventional 
and the jet substructure variables respectively, whereas 
D is the set of all variables. 

In a previous paper [? ] we described the more conven- 
tional separation of single photons from QCD-jets along 
with an initial introduction to the separation of single 
photons from photon-jets. In both cases the single pho- 
tons were treated as the signal. Here we extend that 
discussion and focus on the photon-jets as the signal. 
We organize the results of our analysis into three sub- 
sections. First, we show the results of BDTs optimized 
to discriminate photon-jets from QCD-jets, the analogue 
of the seperation of single photons from QCD-jets. In 
the following subsection, we repeat the same study, but 
optimize it for treating single photons as the background 
to photon-jets. Finally, we demonstrate how the BDTs 
might be used for an effective three-way separation of 
single photons from photon-jets from QCD-jets. 



D. QCD-Jets as Background for Photon-jets 

We use all of the variables in the set of discriminants D 
in the BDTs in order to maximize the extraction of signal 
jets (photon-jets) from background (QCD-jets). This is 
similar to the separation of single photons from QCD-jets 
perfromed in Ref. [? ]. The BDTs are trained individu- 
ally for each study point. The results for fake rate versus 
acceptance are shown in Fig.[9]for all of the study points. 
In this plot the lower right is desirable and the upper left 
is undesirable. Note that the acceptance rate for photon- 
jets is bounded above by about 0.94 due to our preselec- 



tion cut 9 J > 0.25 (see Section IVB4|. The same cut 



eliminates approximately 98% of the QCD-jets yielding 
a fake rate below 10~^ except at the largest acceptance. 

For 2 photon photon-jets (study points 1 to 3) the sep- 
aration becomes easier as toi increases yielding increas- 
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FIG. 9. The background fake rate versus signal acceptance 
where photon-jets from the different study points are the sig- 
nal and QCD-jets are the background. All variables in the set 
of discriminant D are used in the analysis. 



ing separation between the photons inside the jet. The 
other physics scenarios tend to have even more structure 
within the photon-jets that the jet substructure variables 
allow us to use to suppress the QCD background. The 
more structure a jet possesses, the easier it becomes to 
discriminate it from (largely feature-less) QCD-jets. The 
conclusion from Fig. [9] is that, for photon-jets of varied 
kinematic features, we can achieve a very small QCD 
fake rate for a reasonably large acceptance rate. In more 
detail, for all of our study points a tagging efficiency (ac- 
ceptance) of ^ 70% for photon-jets is accompanied by a 
fake rate for QCD-jets of only 1 in 10'' to 1 in 10^. 
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FIG. 10. The improvement brought in because of the use of 
substructure variables are shown in the figure. For a quanti- 
tative definition of improvement see the text. 

It is instructive to quantify the improvements made 
possible by including the jet substructure variables as 
discriminants. To achieve this comparison we consider 
BDTs using only the conventional variables (i.e., we use 
the set Dq of discriminants to train the BDTs). For 
a given acceptance of signal we thus obtain two differ- 
ent fake rates - one when we use only the conventional 



variables (labeled Jx:), and another when we use conven- 
tional -|-jet substructure variables (labeled J-q-i-s)- For a 
given acceptance, the ratio Tc/J^c+S quantifies the im- 
provement due to using jet substructure variables in this 
analysis. The improvement rates for conventional plus 
jet substructure variables over only conventional vari- 
ables versus acceptance for discriminating photon-jets 
from QCD-jets is shown in Fig.[TO]for the different study 
points. While Figs.[2]and[3]indicate that the conventional 
variables provide some discrimination between photon- 
jets and QCD-jets, Figs. |4] to |8] indicate that the jet sub- 
structure variables provide a substantial number of new 
distinguishing features. Fig.[lO]shows that these new fea- 
tures in the jet substructure variables can provide sub- 
stantial improvement. Factors of 4 to 50 improvement 
in the discrimination of photon-jets from QCD-jets are 
possible at an acceptance of about 70%. As expected 
more improvement is possible in those physics scenarios 
where the photon-jets have more structure. Further, our 
results demonstrate that the use of jet substructure vari- 
ables provides a tool to distinguish the different physics 
scenarios, i.e., the different study points, which is not 
possible with conventional variables alone. 



E. Single Photons as Background to Photon-Jets 

Now consider the same analysis as in the previous sec- 
tion but with single photons treated as the background. 
This new sort of separation is essential if we want to 
consider physics scenarios with photon-jets. Again we 
use all of the variables in the set of discriminants D in 
the BDTs in order to maximize the extraction of sig- 
nal jets (photon-jets) from background (single photons). 
The BDTs are trained individually for each study point. 
The results for fake rate versus acceptance are shown in 
Fig. [Tl] As in Fig. |9] the lower right is desirable and the 
upper left is undesirable. Again the acceptance rate for 
photon-jets is bounded above by about 0.94 due to our 
preselection cut 0j > 0.25 (see Section IV B 4). For the 



same reason a similar limit (0.94) holds also for the fake 
rate from single photons (although this is difficult to see 
on the logarithmic scale). 

The results in Fig. [Tl] teach us several lessons. A 
photon-jet from PJSP 1 consists of a pair of highly 
collinear photons. Such a jet is quite photon-like and 
thus difficult to separate from single photons. Hence the 
corresponding (solid black) curve is most towards the up- 
per left. One needs to cut away almost half of the signal 
sample {A ~ 0.55) in order to reduce the fake rate to 
1 in 10'^. We also see that it is a challenge to separate 
the photon-jets for PJSP 3 from single photons (the solid 
red curve). In this scenario mi — 10 GeV and the ni 
decays directly to two photons. Because of the large mi 
value, almost 30% of these {R = 0, 4) jets do not contain 
both of the photons from the ni decay, i.e., about 30% 
of this jet sample are actually single photons (in the jet), 
and not photon-jets. We saw this point earlier in essen- 
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FIG. 11. The background fake rate versus signal acceptance 
curves are shown for all study points. Here the photon-jets 
from the different study points are treated as the signal and 
single photons are the background. These curves employ all 
variables in the set of discriminants D. 



tially all of the individual jet substructure variable plots, 
Figs. |4] to [8) where the PJSP 3 distribution exhibited 
an enhancement that overlapped with the corresponding 
peak in the single photon distribution. A larger separa- 
tion of PJSP 3 from single photons can be obtained at 
an acceptance just below 0.7, where these single photons 
configurations are cut away and the fake rate drops below 
1 in 10"^. The photon-jets of PJSP 2 represent a 'sweet' 
spot between PJSPl and PJSP3 where the 2 photons are 
typically well enough separated to be resolved but close 
enough to be in the same jet. Thus the PJSP 2 (solid 
purple) curve is well below and to the right compared to 
the PJSP 1 (sohd black) and PJSP 3 (sohd red) curves. 
Similarly the photon-jets at the other study points can 
be well separated at even larger acceptance rates using 
the combination of jet substructure and conventional dis- 
criminants. For example, for the study points PJSP4 and 
PJSP 6, even at 85% acceptance, one obtains a fake rate 
smaller than 1 in 10'^. 

Again it is instructive to determine the impact of the 
jet substructure variables for this analysis. As in the pre- 
vious subsection we consider BDTs using only the con- 
ventional variables (i.e., we use the set Dq of discrimi- 
nants to train the BDTs) to compare to the results from 
the full set D of variables. We plot the ratio of fake rates 

ver- 



at fixed acceptance for these two analyses in Fig. 12 
sus the acceptance. A comparison of Fig. [T2| and Fig. [Tl] 

indicates that the bulk of the separation of photon-jets 
from single photons is provided by the jet substructure 
variables, i.e., the improvement factor typically differs by 
less than a factor of 10 from one over the fake rate. Fur- 
ther, the improvement factor ranges from 10 to more that 
10^ even at acceptances as large as 90% for all physics 
scenarios except PJSP 1 and PJSP 3. Even in these chal- 
lenging cases substantial improvement is possible at lower 
acceptance rates. This is not a surprise since the con- 
ventional variables are ineffective at distinguishing be- 
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FIG. 12. The improvement brought in because of the use of 
substructure variables are shown in the figure. For a quanti- 
tative definition of improvement see the text. 



tween photon-jets and single photons. Recall from Fig. [2] 
that the hadronic energy fraction distributions are nearly 
identical for photon-jets and single photons. The distri- 
bution of the number of charge tracks associated with a 
jet, shown in Fig. [3} also indicates only slight differences, 
arising from the somewhat different conversion rates for 
photon-jets versus single photons. So it is clear that, if we 
want to be able to discriminate between photon-jets and 
single photons (and we do), the jet substructure variables 
provide the necessary tool. 



F. Three-way Separation 

Finally we come to the really interesting challenge: 
the simultaneous separation all three samples: single 
photons, photon-jets, and QCD-jets. In principle, one 
could perform three BDT training exercises, separating 
photon-jets from single photons, separating photon-jets 
from QCD-jets and separating single photons from QCD- 
jets, using one of the variable sets of Eqs. [lO] - [12] in 
each case. Then the responses from each of these BDTs 
for each jet could be used to separate the experimen- 
tally identified jets in the corresponding 3-dimensional 
'physics object' space. In order to illustrate these ideas 
in a fairly simple analysis here we will focus on a two- 
dimensional analysis employing the two BDTs we have 
been discussing, separating photon-jets from single pho- 
tons and separating photon-jets from QCD-jets. There 
are still the related questions of which set of variables to 
use for each BDT and, in fact, how to characterize the 
'best separation'!^ Qualitatively at least, we find good 2- 
dimensional separation for the following definitions of the 



^ With three BDTs and the three BDT response numbers for each 
jet, the 'best separation' presumably corresponds to the three 
distinct physics objects being sent to three diagonally opposite 
vertices of the BDT response cube (on a equilateral triangle with 
side of length y/2 times the length of the edge of the cube). 
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BDTs. One is trained to separate QCD-jets and photon- 
jets based only on the conventional discriminants (Dq) 
and is plotted on the vertical axis in the following plots, 
while the other BDT is trained to separate photon-jets 
from single photons with the substructure discriminants 
(Dg) alone and is plotted along the horizontal axis. We 
present the results in terms of two-dimensional contour 
plots where the numerical values associated with a given 
contour corresponds to the relative probability to find 
a calorimeter object of the given kind (indicated by the 
color) in a cell of size 0.1 x 0.1 in BDT response units. 
(Note that, by construction, the BDT responses have val- 
ues in the range —1 to -|-1, where -1-1 means 'signal-like' 
and —1 means 'background-like'.) The color coding in 
these figures matches the previous choices. Red is for 
QCD-jets, blue for photon-jets and green is for single 
photons. 

As a first example. Fig. [13] indicates the 2-dimensional 
distributions resulting from the BDTs for PJSP 2, a sce- 
nario with typically two photons in the photon-jet with 
small angular separation due to the small value of mi. 
When interpreting the following figures it is important 
to recall that the jet samples indicated in these figures 
are constrained to satisfy 6j < 0.25, which means that 
we are only keeping the approximately 2% of QCD-jets 
that are most 'photon- like'. However, Fig. [T3| indicates 



pletely separate (PJSP 2) photon-jets from single pho- 
tons. 
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FIG. 14. The BDT responses of QCD-jets(red), single pho- 
tons(green) and photon-jets(blue) for photon-jets at PJSP 3. 
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FIG. 13. The BDT responses of QCD-jets(red), single pho- 
tons(green) and photon-jets(blue) for photon-jets at PJSP 2. 
The Ds variables are used on the horizontal axis and the Dc 
variables on the vertical axis. 

a pretty clear separation between the QCD-jets and the 
true photon objects (little red above 0.0 in the vertical 
direction). On the other hand, as we expect from our pre- 
vious one-dimensional discussions in Subsection |IV E| the 
blue (photon-jet) contours in the upper-left green (single 
photon) region indicate that it is a challenge to com- 



In the case of PJSP 3 photon-jets, as indicated in 
Fig. [14] the photon-jet versus single photon separation 
challenge is even larger, as we have already discussed. 
Again we have photon-jets with potentially two photons 
but, due to the relatively large mi value, one of those 
photons is sometimes outside of the identified jet. This 
explains the small region with a solid blue (probability 
0.1) contour inside the green (single photon) region. 

The corresponding results for the more complex (and 
more easily separated) photon-jets of PJSP4 and PJSP8, 
typically with 4 photons in a photon-jet, are displayed 
in Figs. [15] and [16] In these scenarios the three-way 
photon-jet versus single photon versus QCD-jet separa- 
tion is fairly cleanly achieved using just the Ds (horizon- 
tal) and Dc (vertical) variable sets. At the 0.005 level 
there is only a tiny overlap of photon-jets with QCD-jets 
for PJSP 4 (near the location (0.5,0.0) in Fig. 15) and no 
overlap for PJSP 8 (Fig.[l6]). 



Before ending this section we should discuss one other 
point. From our previous discussion, one would expect 
to improve the photon-jet versus single photon separa- 
tion by using the full D set of variables (instead of the 
Ds variables alone), and this expectation raises one of 
the interesting, and challenging, features of simultaneous 
separations. Since we are currently training the BDTs so 
that each BDT separates one type of signal from one type 
of background, while, at the same time, trying to perform 
a three-way separation, it can happen that an improve- 
ment in one separation corresponds to a degradation in 
another of the separations. To illustrate this point we 



first reproduce the results in Fig. 14 but now include a 
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FIG. 15. The BDT responses of QCD-jets(red), single pho- 
tons(green) and photon-jets(blue) for photon-jets at PJSP 4. 



FIG. 17. The BDT responses of QCD-jets(red), single pho- 
tons(green) and photon-jets(blue) for photon-jets at PJSP 3 
including a contour at 0.001. 



tons is very soft or is outside of the jet) still lie within 
the single photon boundary. At the same time, however, 
the separation between single photons (green) and the 
(typically more numerous) QCD-jets (red) is somewhat 
degraded (the green and red regions have moved towards 
each other). Due to the coupling between the different 
pairwise separations, optimizing such a three-way separa- 
tion takes careful work and likely depends on the details 
of the actual analysis and detector. 

These results clearly suggest that a three-way separa- 
tion is possible, including the ability to distinguish differ- 
ent photon-jet scenarios. Further enhancement will arise 
from using the full 3-dimensional structure and from us- 
ing a realistic detector simulation in the training. A thor- 
ough optimization in the context of a real detector and 
actual data may select different, more effective choices of 
the discriminating variables. 

Photon-jet/Photon BDT (Ds) response 

FIG. 16. The separation of QCD-jets (red), single photons V. CONCLUSION 
(green) and photon-jets (blue) for photon-jets at PJSP 8. 

In this paper we have attempted to link several con- 
cepts, some conventional and some less so, with the 

contour at relative probability 0.001, which we did not goal of enhancing the searches for and the analyses of 

include earlier to avoid plots that are too busy. The re- both Standard Model and Beyond the Standard Model 

suiting plot is shown in Fig. |17| Now we perform the same physics. We advocate employing general techniques for 

analysis but using the full variable set D in both BDTs. analyzing and interpreting the detector objects identified 

The resulting contour plot is displayed in Fig. [TSj which by applying standard jet algorithms to the calorimeter 

illustrates the relevant points. The 0.001 level bound- cells of typical hadron collider detectors, allowing a uni- 

aries for single photons (green) and photon-jets (blue) versal language for such objects. We have demonstrated 

are now somewhat better separated, although the effec- the efhcacy of employing the recent developments in jet 

lively one-photon-jets from PJSP 3 (when one of the pho- substructure techniques to separate and identify these 
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FIG. 18. The BDT responses of QCD-jets(red), single pho- 
tons(green) and photon-jets(blue) for photon-jets at PJSP 3 
using the full set D of variables on the horizontal axis. 



detector objects in terms of physics objects. Continuing 
the efforts begun in Ref. [? ], we have focused on iden- 
tifying three specific physics objects, the familiar single 
photons, QCD-jets and the Beyond the Standard Model 
(and LHC) relevant photon-jets. In particular, we have 
demonstrated that it is possible to achieve significant sep- 
aration between photon-jets and their dominant back- 
grounds, i.e., single photons and QCD-jets. We expect 
that both the ATLAS and CMS groups could enhance 
their searches for signatures of new physics by adopting 
the methods described. These methods should allow the 
separation of photon-jets from single photons from QCD- 
jets, and also provide some identification of the specific 
dynamics yielding the photon-jets. 

We note that our simulation does not take into ac- 
count the impact of magnetic fields inside the detectors. 
On the other hand, one might interpret this absence of 
a magnetic field as making our results more conserva- 
tive. When the magnetic field bends the electrons and 
positrons from converted photons, this serves to generate 
more structure inside the jet. The substructure variables, 
as we have described, tend to become more powerful with 
more structure. A more detailed analysis is, however, be- 
yond the scope of this paper. 

Finally, it is worth mentioning that the formalism and 
techniques developed in this paper for photon-jets should 
work in a similar way for the case of collinear electrons, 
often labeled 'electron-jets' [55H55] . An electron-jet is 
characterized by a large number of charged tracks along 
with a small hadronic energy fraction. Also, we expect 
the electrons inside these jets to bend in a magnetic field, 
creating more substructure. Therefore we anticipate that 



multivariate analyses similar to those described here will 
be correspondingly effective at separating electron-jets 
from QCD-jets (and photon-jets). 
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Appendix A: Technical Details 

1. Conversions 

In the material of the detector photons convert into 
electron-positron pairs. Our implementation of photon 
conversion is based on the properties of the ATLAS 
detector [59 . We associate an rj dependent probability of 
conversion with every photon. This probability is a func- 
tion of the number of radiation lengths a photon passes 
through (labeled n(ri)) in order to escape the first layer 
of the pixels. We model the probability to convert using 
the expression: 

P(,7) = l-exp(^-^n(77)^ , (Al) 

where the factor of 7/9 comes from conversion between 
radiation length and mean free path. A plot of the ex- 
tracted radiation length profile of the inner pixel detector 
in the ATLAS detector that we use to determine n{ri) is 
displayed in Fig. [T9j The yellow-black dashed line shows 




i '0.0 0.5 1.0 1.5 2.0 2.5 



FIG. 19. Number of radiation length up to and through the 
layer of pixel detectors in the ATLAS detector. 
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the extracted ry-dependent number-of-radiation-lengths a 
photon needs to travel before it exits the pixel layer. If 
the photon converts before the dashed line, it is treated 
as a charged track, otherwise no charged track is included 
in our simulation. 



2. Details of the Calorimeters 

Our simulation of calorimeters closely resembles the 
calorimeters implemented in the widely used simulation 
tool PGS [5D]. The electromagnetic calorimeter ECal 
covers jTyj < 2.5 in the pseudorapidity direction, whereas 
the hadronic calorimeter HCal covers the range \ri\ < 5.0. 
Both of the calorimeters provide a full 2tt coverage in ip, 
the azimuthal direction. The ECal has granularity of 
0.025 X 0.025 in the -q-cf) plane. The HCal, on the other 
hand, has a coarser granularity with cells of size 0.1 x 0.1. 

We assume that all particles generated at the interac- 
tion point (i.e., the Pythia output), except charged par- 
ticles with energy less than 0.1 GeV, reach the ECal. 
Photon/electrons within |?7| < 2.5 deposit 99% of their 
energy in the ECal and the rest in the HCal. The ECal 
also fully absorbs hadrons with energy less than 0.5 GeV. 
For more energetic hadrons, 0.5 GeV is deposited in the 
ECal and the rest in the HCal. Photon/electron/hadrons 
that lie outside the pseudorapidity range of the ECal but 
within the range of the HCal (i.e., 5.0 > \ri\ > 2.5) are 
completely absorbed in the HCal. 

In our simulation, muons also deposit energy in the 



calorimeters. Within the range < 2.5, muons with 
E < 0.5 GeV deposit all of their energy in the ECal; 
muons with 0.5 GeV < E < 2.5 GeV deposit 0.5 GeV 
in the ECal and the rest in the HCal; muons with E > 
2.5 GeV deposit 0.5 GeV in the ECal and 2.0 GcV in the 
HCal. For muons within the rapidity range 5.0 > \ri\ > 
2.5, those with E < 2.0 GeV are fully absorbed in the 
HCal, while muons with E > 2.0 GeV deposit 2.0 GeV 
in the HCal. 



3. Moliere Matrix 

In order to simulate the transverse smearing in the 
ECal, we deposit the energy of each particle not just into 
the specific cell through which it passes, but also into 
the surrounding cells. The (rj-cf)) coordinates of a particle 
determine the cell in which it deposits most of its energy 
(call it the (i,j)-th cell in the grid). Wc distribute its 
energy in the neighboring cells according to the following 
table: 





i-1 


i 


i+l 


j-1 


0.00(4) 


0.01(4) 


0.00(4) 


j 


0.01(4) 


0.92(4) 


0.01(4) 


j+1 


0.00(4) 


0.01(4) 


0.00(4) 



We estimate these numbers by integrating the energy de- 
posited in an electromagnetic shower in a cell of size 
(0.025 X 0.025) and situated 1.5 m from the origin of 
the shower. We assume that the cell is made of lead 
(i?M(Pb) = 1.6 cm) . 
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